Automated indexing for making of a newspaper article database.

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Indexing of Newspaper Microfilm Images

This paper describes a proposed document analysis system that aims at automatic indexing of digitized images of old newspaper microfilms. This is done by extracting news headlines from microfilm images. The headlines are then converted to machine readable text by OCR to serve as indices to the respective news articles. A major challenge to us is the poor image quality of the microfilm as most i...

متن کامل

Linking article parts for the creation of newspaper digital library

An important issue pertaining to the retro-conversion of newspapers, i.e. the conversion of newspaper issues into digital resources, is the identification and appropriate digital representation of an article. To complete this task, a number of steps have to be followed, from segmentation of the newspaper image to optical character recognition and linking of different items belonging to the same...

متن کامل

Textual Article Clustering in Newspaper Pages

In the analysis of a newspaper page an important step is the clustering of various text blocks into logical units, i.e., into articles. We propose three algorithms based on text processing techniques to cluster articles in newspaper pages. Based on the complexity of the three algorithms and experiment on actual pages from the Italian newspaper L’Adige, we select one of the algorithms as the pre...

متن کامل

Exploitation of Newspaper-article Characteristics for Article Retrieval and Answer Extraction in QAC Task 2

In this paper, we discuss a system for newspaper article retrieval and answer extraction. Due to the rapidly increasing amount of accessible information, systems that allow search in natural language are expected to play a much more important role in the very near future. Our system, called RAIK-Prassie, is designed for TASK 2 of QAC. The design of the RAIK-Prassie system focuses mainly on prac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Information Processing and Management

سال: 1989

ISSN: 0021-7298,1347-1597

DOI: 10.1241/johokanri.32.283